CodeClone 2.1.0a1 — Structural Change Controller, Engineering Memory, and native agent integrations by orenlab · Pull Request #37 · orenlab/codeclone

orenlab · 2026-06-17T18:08:10Z

Summary

Opens the CodeClone 2.1 alpha line. Moves CodeClone from a read-only
structural analyzer to an intent-first structural change controller for
AI-assisted Python development, and adds Engineering Memory, trajectory and
experience layers, semantic retrieval, opt-in Platform Observability, native
agent integrations, offline Corpus Analytics, and a reorganized docs site.

Highlights

Structural Change Controller — start_controlled_change /
finish_controlled_change cut the governed agent edit cycle from 7–11 MCP
calls to 3–4 (workspace check → intent → blast radius → bounded scope → patch
verify → claim validation → deterministic receipt). 33 agent-visible MCP tools.
Live Implementation Context — get_implementation_context projects
bounded structural / call-graph / contract evidence from one stored run;
per-function relationship facts off the canonical report (cache schema 2.9 → 2.10),
with separate context-artifact and projection digests. Evidence never authorizes
edits; edit_allowed stays authoritative.
Change-intent lifecycle & multi-agent coordination — manage_change_intent
(declare / check / clear / queue / promote / recover), renewable leases,
optional SQLite coordination, workspace hygiene, recoverable-intent handling.
Engineering Memory — local SQLite knowledge graph of typed, evidence-linked
facts; get_relevant_memory / query_engineering_memory; drafts stay
human-governed (CLI / VS Code Memory view). Never authorizes edits or overrides
the report, gates, or Patch Trail.
Trajectory Memory + Patch Trail + Experience Layer — audit-derived workflows
with quality passports, complexity scoring, anomaly detection; advisory
experience patterns (Engineering Memory schema 1.7).
Semantic retrieval — optional LanceDB hybrid FTS5/BM25 + vector via
deterministic Reciprocal Rank Fusion; local embeddings via
codeclone[semantic-local]; lazy, failure-tolerant, eventually consistent.
Platform Observability — opt-in, development-only telemetry of CodeClone's
own runtime (timings, RSS/CPU, MCP payload/token pressure, DB query shapes,
causal worker chains, costly no-ops); JSON/HTML cockpit + bounded
query_platform_observability. Never affects reports, gates, memory, or auth.
Native agent integrations — VS Code (memory governance, trajectory
dashboards, controller audit, session stats), Claude Desktop .mcpb, a
dedicated Claude Code marketplace plugin, Codex, and Cursor (skills, rules,
fail-closed preToolUse enforcement, structural-review agent).
Corpus Analytics — optional offline clustering of historical change-control
intents (codeclone[analytics]) with interpretability, versioned profiles, and
maintainer selection control; artifacts under .codeclone/analytics/.
Docs — reorganized thematic contract book + unified integration guides;
Zensical strict/clean builds.

Changed / migration

Default workspace moved .cache/codeclone/ → .codeclone/ (legacy paths emit a migration warning).
pydantic is now a base dependency.
LCOM4 excludes Protocol methods and Pydantic validation/serialization hooks (computed_field still counts).
Repository test coverage enforced at ≥99%.

Fixed (notable)

Durable memory writes (synchronous=FULL) survive unclean MCP exits; atomic memory ingestion (deferred batch commit).
Semantic retrieval preserves lexical + vector relevance through RRF; per-source vector budgets; coalesced/deferred projection jobs.
Workspace hygiene/intent attribution: finish blocks only on missing evidence or foreign dirty overlap; continue_own_wip resumes owned work.
Patch verification rejects identical before/after runs for structural/governance profiles; negative health deltas surface a regression advisory.
Architecture: blast-radius graph moved to codeclone/analysis/blast_radius.py, removing the CLI→MCP dependency violation.

Compatibility

Cache schema → 2.9 / 2.10 (additive relationship-fact projection; canonical report identity unchanged).
Engineering Memory schema 1.7. Read-only MCP semantics and fingerprint contract unchanged.

Release plan

Merge feat/2.1-alpha → main now to cut the release. Remaining bugs land on a
follow-up fix/* branch off main. Target: release Monday.

Pre-merge checklist

uv run pre-commit run --all-files
uv run pytest -q (coverage ≥99%)
Docs strict build (zensical build --clean --strict)
MCP tool visibility / schema tests (tests/test_mcp_service.py, tests/test_mcp_server.py)

Add concise tool param descriptions via pydantic Field annotations, reject cache_policy=refresh at the MCP boundary, and refresh the tool schema snapshot. Compact agent payloads with next_tool hints, help anti_patterns in compact mode, trust_boundaries topic, and shorter workflow messages. Load golden_fixture_paths from pyproject even when respect_pyproject=false so fixture clones stay suppressed without lowering analysis thresholds. Refactor duplicated_branches hotspots in workspace intents, session intent helpers, and Cursor hooks. Update tests, MCP docs, and changelog.

Centralize MCP tool descriptions, help topics, and workflow strings under codeclone/surfaces/mcp/messages/; update CHANGELOG, AGENTS.md, and docs paths.

…d MCP

Optional sqlite backend keeps closed intents with status transitions and retention purge (7d default, 14d OSS max), pydantic wire validation, schema v2, shared sqlite helpers, and plans/retention docs.

Exclude Protocol stub methods and Pydantic validation/serialization hooks from the LCOM4 graph while keeping computed_field in the graph. Add a one-time CLI note for trusted 2.0.2 baselines and document applicability in metrics docs.

…ators Refactor Pydantic contract guards and gc/schema control flow without changing validation semantics or wire payloads.

Codex plugin instructions and bundled skills now mandate start/finish before edits and require surfacing structural_delta and receipt advisories even when finish is accepted. Align CLAUDE.md and Cursor workflow rule with the pipeline.

Cover scope/integrity validators, SQLite store edge paths, and schema migration branches that were previously untested.

Raise line coverage toward the 99% gate with branch tests for session stats, patch verify, HTML helpers, extractor edges, and pipeline metrics.

Cover scope/integrity validators, SQLite store edge paths, and schema migration branches that were previously untested.

Raise line coverage toward the 99% gate with branch tests for session stats, patch verify, HTML helpers, extractor edges, and pipeline metrics.

Advise adding `.cache/codeclone/` when the repository root `.gitignore` does not cover CodeClone ephemeral state. Surfaces: MCP tips[] on analyze, summary, triage, and start; CLI tip after interactive runs. Shared check lives in codeclone/paths/gitignore.py; neither surface edits `.gitignore` automatically

Split PID, staleness, registry lock, and git-scoped hygiene into leaf modules; dedupe lifecycle predicates and route store GC through staleness.

Add continue_own_wip start policy, after_run_not_new verify guard, health regression advisory, recovery hints, queued-foreign finish hygiene, and patch_health_delta on validate_review_claims.

Document dirty_scope_policy, verify advisories, recovery hints, lazy-close semantics, and patch_health_delta claim-guard wiring across MCP docs and skills.

Use list_workspace_intent_records_for_recovery so expired_count includes TTL-expired recoverable intents instead of filtering them out at list time.

Compose MCP session mixins in session.py instead of serial cross-file inheritance so module import depth drops without changing runtime MRO. Add mypy overrides for composed mixin modules.

Add workspace hygiene and sqlite store lifecycle tests, shared runpy guard helper, global intent-store cache autouse, and coverage for session stats, workflow finish paths, and claim guard behavior.

Fix three documentation-vs-code divergences found during audit: - add changed_paths/git_diff_ref to analyze_repository param table - add missing strictness param to finish_controlled_change table - document all three supported suppression rule IDs Condense the 2.1.0a1 changelog by merging related items and collapsing

Add coerce_repo_path_tuple and coerce_object_dict helpers for cross-mixin calls after session flattening; drop no-any-return mypy override and keep 99% coverage without dead internal guard branches.

Consolidate duplicated continue branches in workspace hygiene into _skip_foreign_dirty_record without changing overlap semantics.

# Conflicts: # tests/test_workspace_intent_sqlite_store.py

Fix schema versions, source_kind filters, workflow tool guidance, and read-only semantics across book chapters, extension READMEs, and plugin manifests after the full documentation audit.

Filter Poetry launcher probe subprocess env like exec path. Open Production Triage fetches get_production_triage with a 5s cooldown and in-flight dedup.

…finish finish_controlled_change validates claims via claims_text while review_text stays a human note; finish responses add summary and workspace hygiene. Workspace intent I/O is thread-safe with correct sqlite closed_at reactivation, shutdown closes audit writers, and worker caches process_file signature lookup.

…s core Move blast-radius graph traversal to codeclone/analysis/blast_radius.py so CLI and MCP share neutral logic without surface-layer import violations. finish_hygiene_check cross-checks the full git tree against finish evidence, blocking under-reported in-scope dirty paths and own unscoped edits while ignoring foreign active/stale intent paths outside declared scope. Sync docs, skills, and CHANGELOG with payload semantics and recoverable nuance.

SQLite store, init ingestion, scoped retrieval, staleness/vacuum, governance CLI, coverage metrics, MCP tools (get_relevant_memory, query_engineering_memory, manage_engineering_memory), finish propose_memory hook, shared file_lock, and baseline refresh for CI.

…ngs bridge

…P tool

…gurable) + honest editions page

github-actions · 2026-06-17T18:09:13Z

CodeClone Review

✅ Passed · Health 91/100 (A) · Baseline ok · Cache miss · CodeClone 2.1.0a1

Review snapshot

Area	Signal	Review note
Clones	0 total, 0 new, 0 known	no new clone debt reported
Quality	CC max 20, CBO max 9, LCOM4 max 3, overloaded 41	structural metric snapshot
Dependencies	avg 6.8, p95 21, max 23, cycles 0	acyclic
Coverage Join	not joined	no coverage.xml facts in this report
Security Surfaces	239 surfaces, 6 categories, 124 production	report-only boundary inventory
API Surface	6722 symbols, 611 modules	0 breaking, 0 added
Dead code	0 high-confidence, 2 suppressed	clean

Review focus

Treat 124 production security surface(s) as review-first boundary code when touched.
Review 41 overloaded module candidate(s) when they intersect this PR.

_{Security Surfaces are report-only capability inventory, not vulnerability claims. Generated by CodeClone}

orenlab added 30 commits May 30, 2026 00:23

fix(mcp): cap process pool size

81ffa91

docs(security): document trust boundaries

e590acf

test: cover MCP process cap and symlink resolve edge paths

c8b2b5f

refactor(mcp): extract agent-facing copy into messages package

2d5c742

Centralize MCP tool descriptions, help topics, and workflow strings under codeclone/surfaces/mcp/messages/; update CHANGELOG, AGENTS.md, and docs paths.

refactor(copy): centralize user-facing strings across CLI, report, an…

c643bd4

…d MCP

feat(mcp): add auditable SQLite workspace intent registry

52a551d

Optional sqlite backend keeps closed intents with status transitions and retention purge (7d default, 14d OSS max), pydantic wire validation, schema v2, shared sqlite helpers, and plans/retention docs.

refactor(mcp): eliminate duplicated_branches in intent registry valid…

af9d3b3

…ators Refactor Pydantic contract guards and gc/schema control flow without changing validation semantics or wire payloads.

test(mcp): expand workspace intent registry validation coverage

247e941

Cover scope/integrity validators, SQLite store edge paths, and schema migration branches that were previously untested.

test: add targeted coverage for CLI, report, and metrics paths

0b58566

Raise line coverage toward the 99% gate with branch tests for session stats, patch verify, HTML helpers, extractor edges, and pipeline metrics.

test(mcp): expand workspace intent registry validation coverage

0ca0c4f

Cover scope/integrity validators, SQLite store edge paths, and schema migration branches that were previously untested.

test: add targeted coverage for CLI, report, and metrics paths

5370c0b

Raise line coverage toward the 99% gate with branch tests for session stats, patch verify, HTML helpers, extractor edges, and pipeline metrics.

refactor(mcp): extract workspace intent leaf modules and scoped hygiene

6aca43a

Split PID, staleness, registry lock, and git-scoped hygiene into leaf modules; dedupe lifecycle predicates and route store GC through staleness.

feat(mcp): harden multi-agent change-control workflow and claim guard

e71afbd

Add continue_own_wip start policy, after_run_not_new verify guard, health regression advisory, recovery hints, queued-foreign finish hygiene, and patch_health_delta on validate_review_claims.

docs(mcp): sync multi-agent change-control and claim guard contracts

bea0e94

Document dirty_scope_policy, verify advisories, recovery hints, lazy-close semantics, and patch_health_delta claim-guard wiring across MCP docs and skills.

fix(cli): count expired intents via recovery listing in session stats

db116bf

Use list_workspace_intent_records_for_recovery so expired_count includes TTL-expired recoverable intents instead of filtering them out at list time.

refactor(mcp): flatten session mixin import chain in session.py

cfb39bb

Compose MCP session mixins in session.py instead of serial cross-file inheritance so module import depth drops without changing runtime MRO. Add mypy overrides for composed mixin modules.

test(mcp): expand Phase 17 coverage and hygiene test contracts

9c72b7f

Add workspace hygiene and sqlite store lifecycle tests, shared runpy guard helper, global intent-store cache autouse, and coverage for session stats, workflow finish paths, and claim guard behavior.

fix(mcp): restore strict no-any-return typing for composed mixins

9223980

Add coerce_repo_path_tuple and coerce_object_dict helpers for cross-mixin calls after session flattening; drop no-any-return mypy override and keep 99% coverage without dead internal guard branches.

fix(mcp): extract foreign dirty overlap skip guard

a5487ed

Consolidate duplicated continue branches in workspace hygiene into _skip_foreign_dirty_record without changing overlap semantics.

Merge remote-tracking branch 'origin/feat/2.1-alpha' into feat/2.1-alpha

1a44cc6

# Conflicts: # tests/test_workspace_intent_sqlite_store.py

docs: align docs and client surfaces with MCP contracts

3384c31

Fix schema versions, source_kind filters, workflow tool guidance, and read-only semantics across book chapters, extension READMEs, and plugin manifests after the full documentation audit.

fix(launcher,vscode): harden poetry probe env and live triage refresh

0bab286

Filter Poetry launcher probe subprocess env like exec path. Open Production Triage fetches get_production_triage with a 5s cooldown and in-flight dedup.

orenlab added 19 commits June 15, 2026 19:38

feat(plugins): add codeclone-implementation-context skill

92f9468

feat(skills): synchronized skills

f67d54c

fix(memory): make persist_batch atomic (audit H1)

561ce6d

fix(audit): count swallowed best-effort failures (audit M9+M10)

25203b2

fix(mcp): compact subject_not_found in get_implementation_context

1058db4

feat(mcp): expose raw module imports off-report (track 2 step 1)

6891f7e

feat(mcp): add graph search via get_implementation_context query param

4ac48c7

docs: align site and agent playbooks with current contracts

f2dab51

fix(memory): align IDE stale-reject with reject_record

4041dbf

fix(memory): tolerate corrupt payload_json on read

baf753a

docs(skills): sync all plugins to one strict 8-skill set + gate→findi…

7b4a5bd

…ngs bridge

chore(tests): extend tests coverage

38940cf

chore(deps): update direct and transitive project deps

4cb89d1

fix(test): isolate fastembed import_module mock in foundation test

5291b8f

chore(vscode): require VS Code 1.120 and bump @types/vscode

f67507f

fix(claude-desktop): sync manifest with get_implementation_context MC…

98f73e2

…P tool

docs: align site and agent playbooks with current contracts

4ba631c

test(memory): include contracts path in init batch repo smoke registry

088b077

feat(core): drop OSS intent-registry retention cap (default 14, confi…

ece558d

…gurable) + honest editions page

orenlab self-assigned this Jun 17, 2026

github-advanced-security AI found potential problems Jun 17, 2026

View reviewed changes

orenlab removed the bug Something isn't working label Jun 17, 2026

orenlab merged commit 3e3a05a into main Jun 17, 2026
26 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CodeClone 2.1.0a1 — Structural Change Controller, Engineering Memory, and native agent integrations#37

CodeClone 2.1.0a1 — Structural Change Controller, Engineering Memory, and native agent integrations#37
orenlab merged 319 commits into
mainfrom
feat/2.1-alpha

orenlab commented Jun 17, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented Jun 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

orenlab commented Jun 17, 2026

Summary

Highlights

Changed / migration

Fixed (notable)

Compatibility

Release plan

Pre-merge checklist

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented Jun 17, 2026

CodeClone Review

Review snapshot

Review focus

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants